Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 7110 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1075 |
| Duplicate rows (%) | 15.1% |
| Total size in memory | 583.4 KiB |
| Average record size in memory | 84.0 B |
Variable types
| Numeric | 4 |
|---|---|
| Categorical | 8 |
TipoTransaccionID has constant value "11" | Constant |
TipoTransaccionNombre has constant value "Stock Receipt" | Constant |
ClienteID has constant value "0.0" | Constant |
InvoiceID has constant value "0.0" | Constant |
| Dataset has 1075 (15.1%) duplicate rows | Duplicates |
FechaTransaccion has a high cardinality: 1256 distinct values | High cardinality |
TransaccionProductoID is highly correlated with Cantidad | High correlation |
Cantidad is highly correlated with TransaccionProductoID | High correlation |
TransaccionProductoID is highly correlated with Cantidad | High correlation |
Cantidad is highly correlated with TransaccionProductoID | High correlation |
ClienteID is highly correlated with TipoTransaccionID and 5 other fields | High correlation |
TipoTransaccionID is highly correlated with ClienteID and 5 other fields | High correlation |
InvoiceID is highly correlated with ClienteID and 5 other fields | High correlation |
NombreProveedor is highly correlated with ClienteID and 5 other fields | High correlation |
NombreProducto is highly correlated with ClienteID and 5 other fields | High correlation |
TipoTransaccionNombre is highly correlated with ClienteID and 5 other fields | High correlation |
ProveedorID is highly correlated with ClienteID and 5 other fields | High correlation |
TransaccionProductoID is highly correlated with OrdenDeCompraID and 1 other fields | High correlation |
ProductoID is highly correlated with NombreProducto and 3 other fields | High correlation |
NombreProducto is highly correlated with ProductoID and 3 other fields | High correlation |
ProveedorID is highly correlated with ProductoID and 2 other fields | High correlation |
NombreProveedor is highly correlated with ProductoID and 2 other fields | High correlation |
OrdenDeCompraID is highly correlated with TransaccionProductoID and 1 other fields | High correlation |
Cantidad is highly correlated with TransaccionProductoID and 3 other fields | High correlation |
Reproduction
| Analysis started | 2022-06-21 18:24:35.430234 |
|---|---|
| Analysis finished | 2022-06-21 18:26:31.288176 |
| Duration | 1 minute and 55.86 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 6035 |
|---|---|
| Distinct (%) | 84.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 211952.4606 |
| Minimum | 89146 |
|---|---|
| Maximum | 335846 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 27.9 KiB |
Quantile statistics
| Minimum | 89146 |
|---|---|
| 5-th percentile | 101733.45 |
| Q1 | 151949.25 |
| median | 210032 |
| Q3 | 274549.75 |
| 95-th percentile | 322747 |
| Maximum | 335846 |
| Range | 246700 |
| Interquartile range (IQR) | 122600.5 |
Descriptive statistics
| Standard deviation | 70957.42693 |
|---|---|
| Coefficient of variation (CV) | 0.3347799159 |
| Kurtosis | -1.205484652 |
| Mean | 211952.4606 |
| Median Absolute Deviation (MAD) | 61138.5 |
| Skewness | 0.01969804336 |
| Sum | 1506981995 |
| Variance | 5034956437 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 300347 | 2 | < 0.1% |
| 96402 | 2 | < 0.1% |
| 137660 | 2 | < 0.1% |
| 236835 | 2 | < 0.1% |
| 232716 | 2 | < 0.1% |
| 238719 | 2 | < 0.1% |
| 315723 | 2 | < 0.1% |
| 166951 | 2 | < 0.1% |
| 95285 | 2 | < 0.1% |
| 176431 | 2 | < 0.1% |
| Other values (6025) | 7090 |
| Value | Count | Frequency (%) |
| 89146 | 1 | |
| 89147 | 1 | |
| 89148 | 1 | |
| 89149 | 1 | |
| 89150 | 1 | |
| 89151 | 1 | |
| 89153 | 1 | |
| 89154 | 1 | |
| 89558 | 1 | |
| 89559 | 1 |
| Value | Count | Frequency (%) |
| 335846 | 2 | |
| 335845 | 1 | |
| 335844 | 1 | |
| 335842 | 1 | |
| 335841 | 1 | |
| 335840 | 1 | |
| 335839 | 1 | |
| 335838 | 1 | |
| 335837 | 1 | |
| 335509 | 2 |
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120.4907173 |
| Minimum | 77 |
|---|---|
| Maximum | 227 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 27.9 KiB |
Quantile statistics
| Minimum | 77 |
|---|---|
| 5-th percentile | 77 |
| Q1 | 80 |
| median | 95 |
| Q3 | 184 |
| 95-th percentile | 204 |
| Maximum | 227 |
| Range | 150 |
| Interquartile range (IQR) | 104 |
Descriptive statistics
| Standard deviation | 51.43712542 |
|---|---|
| Coefficient of variation (CV) | 0.4268969973 |
| Kurtosis | -1.338092278 |
| Mean | 120.4907173 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.742631387 |
| Sum | 856689 |
| Variance | 2645.777871 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=17)
| Value | Count | Frequency (%) |
| 78 | 823 | |
| 86 | 817 | |
| 77 | 808 | |
| 204 | 805 | |
| 193 | 804 | |
| 95 | 802 | |
| 98 | 797 | |
| 80 | 785 | |
| 184 | 658 | |
| 222 | 2 | < 0.1% |
| Other values (7) | 9 | 0.1% |
| Value | Count | Frequency (%) |
| 77 | 808 | |
| 78 | 823 | |
| 80 | 785 | |
| 86 | 817 | |
| 95 | 802 | |
| 98 | 797 | |
| 184 | 658 | |
| 193 | 804 | |
| 204 | 805 | |
| 220 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 227 | 1 | < 0.1% |
| 226 | 1 | < 0.1% |
| 225 | 1 | < 0.1% |
| 224 | 2 | < 0.1% |
| 223 | 2 | < 0.1% |
| 222 | 2 | < 0.1% |
| 221 | 1 | < 0.1% |
| 220 | 1 | < 0.1% |
| 204 | 805 | |
| 193 | 804 |
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| "The Gu" red shirt XML tag t-shirt (White) XS | |
|---|---|
| "The Gu" red shirt XML tag t-shirt (White) 5XL | |
| "The Gu" red shirt XML tag t-shirt (White) XXS | |
| Tape dispenser (Red) | |
| Black and orange glass with care despatch tape 48mmx75m | |
| Other values (12) |
Length
| Max length | 55 |
|---|---|
| Median length | 45 |
| Mean length | 42.75893108 |
| Min length | 20 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Black and orange glass with care despatch tape 48mmx75m |
|---|---|
| 2nd row | Black and orange glass with care despatch tape 48mmx75m |
| 3rd row | Black and orange glass with care despatch tape 48mmx75m |
| 4th row | Black and orange glass with care despatch tape 48mmx75m |
| 5th row | Black and orange glass with care despatch tape 48mmx75m |
Common Values
| Value | Count | Frequency (%) |
| "The Gu" red shirt XML tag t-shirt (White) XS | 823 | |
| "The Gu" red shirt XML tag t-shirt (White) 5XL | 817 | |
| "The Gu" red shirt XML tag t-shirt (White) XXS | 808 | |
| Tape dispenser (Red) | 805 | |
| Black and orange glass with care despatch tape 48mmx75m | 804 | |
| "The Gu" red shirt XML tag t-shirt (Black) XL | 802 | |
| "The Gu" red shirt XML tag t-shirt (Black) 4XL | 797 | |
| "The Gu" red shirt XML tag t-shirt (White) M | 785 | |
| Shipping carton (Brown) 305x305x305mm | 658 | |
| Chocolate beetles 250g | 2 | < 0.1% |
| Other values (7) | 9 | 0.1% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| red | 5637 | |
| the | 4832 | 8.7% |
| shirt | 4832 | 8.7% |
| xml | 4832 | 8.7% |
| tag | 4832 | 8.7% |
| t-shirt | 4832 | 8.7% |
| gu | 4832 | 8.7% |
| white | 3235 | 5.8% |
| black | 2403 | 4.3% |
| tape | 1609 | 2.9% |
| Other values (32) | 13934 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| 11 |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 11 |
|---|---|
| 2nd row | 11 |
| 3rd row | 11 |
| 4th row | 11 |
| 5th row | 11 |
Common Values
| Value | Count | Frequency (%) |
| 11 | 7110 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 11 | 7110 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| Stock Receipt |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Stock Receipt |
|---|---|
| 2nd row | Stock Receipt |
| 3rd row | Stock Receipt |
| 4th row | Stock Receipt |
| 5th row | Stock Receipt |
Common Values
| Value | Count | Frequency (%) |
| Stock Receipt | 7110 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| stock | 7110 | |
| receipt | 7110 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| 0.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 7110 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 7110 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| 0.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 7110 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 7110 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| 4.0 | |
|---|---|
| 7.0 | |
| 1.0 | 11 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 7.0 |
|---|---|
| 2nd row | 7.0 |
| 3rd row | 7.0 |
| 4th row | 7.0 |
| 5th row | 7.0 |
Common Values
| Value | Count | Frequency (%) |
| 4.0 | 4832 | |
| 7.0 | 2267 | |
| 1.0 | 11 | 0.2% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 4.0 | 4832 | |
| 7.0 | 2267 | |
| 1.0 | 11 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| Fabrikam Inc. | |
|---|---|
| Litware Inc. | |
| A Datum Corporation | 11 |
Length
| Max length | 19 |
|---|---|
| Median length | 13 |
| Mean length | 12.69043601 |
| Min length | 12 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Litware Inc. |
|---|---|
| 2nd row | Litware Inc. |
| 3rd row | Litware Inc. |
| 4th row | Litware Inc. |
| 5th row | Litware Inc. |
Common Values
| Value | Count | Frequency (%) |
| Fabrikam Inc. | 4832 | |
| Litware Inc. | 2267 | |
| A Datum Corporation | 11 | 0.2% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| inc | 7099 | |
| fabrikam | 4832 | |
| litware | 2267 | 15.9% |
| a | 11 | 0.1% |
| datum | 11 | 0.1% |
| corporation | 11 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1471 |
|---|---|
| Distinct (%) | 20.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1345.997328 |
| Minimum | 602 |
|---|---|
| Maximum | 2072 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 602 |
|---|---|
| 5-th percentile | 682 |
| Q1 | 986 |
| median | 1347 |
| Q3 | 1710 |
| 95-th percentile | 1998 |
| Maximum | 2072 |
| Range | 1470 |
| Interquartile range (IQR) | 724 |
Descriptive statistics
| Standard deviation | 420.3774096 |
|---|---|
| Coefficient of variation (CV) | 0.3123166748 |
| Kurtosis | -1.182190014 |
| Mean | 1345.997328 |
| Median Absolute Deviation (MAD) | 362 |
| Skewness | -0.01399184079 |
| Sum | 9570041 |
| Variance | 176717.1665 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 829 | 11 | 0.2% |
| 1579 | 10 | 0.1% |
| 1498 | 10 | 0.1% |
| 1186 | 10 | 0.1% |
| 1237 | 10 | 0.1% |
| 1313 | 10 | 0.1% |
| 1043 | 9 | 0.1% |
| 1028 | 9 | 0.1% |
| 1606 | 9 | 0.1% |
| 1764 | 9 | 0.1% |
| Other values (1461) | 7013 |
| Value | Count | Frequency (%) |
| 602 | 6 | |
| 603 | 2 | < 0.1% |
| 604 | 6 | |
| 605 | 3 | |
| 606 | 6 | |
| 607 | 2 | < 0.1% |
| 608 | 6 | |
| 609 | 2 | < 0.1% |
| 610 | 6 | |
| 611 | 3 |
| Value | Count | Frequency (%) |
| 2072 | 4 | |
| 2071 | 6 | |
| 2070 | 4 | |
| 2069 | 3 | |
| 2068 | 6 | |
| 2067 | 3 | |
| 2066 | 7 | |
| 2065 | 3 | |
| 2064 | 7 | |
| 2063 | 5 |
| Distinct | 1256 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 55.7 KiB |
| 2015-04-06 07:00:00.0000000 | 20 |
|---|---|
| 2016-01-04 07:00:00.0000000 | 16 |
| 2014-05-19 07:00:00.0000000 | 16 |
| 2015-09-28 07:00:00.0000000 | 16 |
| 2014-12-29 07:00:00.0000000 | 15 |
| Other values (1251) |
Length
| Max length | 27 |
|---|---|
| Median length | 27 |
| Mean length | 21.98171589 |
| Min length | 11 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 44 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | May 30,2016 |
|---|---|
| 2nd row | May 31,2016 |
| 3rd row | May 27,2016 |
| 4th row | May 26,2016 |
| 5th row | May 24,2016 |
Common Values
| Value | Count | Frequency (%) |
| 2015-04-06 07:00:00.0000000 | 20 | 0.3% |
| 2016-01-04 07:00:00.0000000 | 16 | 0.2% |
| 2014-05-19 07:00:00.0000000 | 16 | 0.2% |
| 2015-09-28 07:00:00.0000000 | 16 | 0.2% |
| 2014-12-29 07:00:00.0000000 | 15 | 0.2% |
| 2014-11-03 07:00:00.0000000 | 15 | 0.2% |
| 2015-09-21 07:00:00.0000000 | 15 | 0.2% |
| 2015-02-02 07:00:00.0000000 | 15 | 0.2% |
| 2014-11-24 07:00:00.0000000 | 15 | 0.2% |
| 2015-03-09 07:00:00.0000000 | 15 | 0.2% |
| Other values (1246) | 6952 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 07:00:00.0000000 | 4880 | |
| may | 235 | 1.7% |
| mar | 229 | 1.6% |
| jan | 226 | 1.6% |
| apr | 223 | 1.6% |
| feb | 213 | 1.5% |
| oct | 170 | 1.2% |
| dec | 169 | 1.2% |
| jun | 165 | 1.2% |
| jul | 156 | 1.1% |
| Other values (728) | 7554 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 3317 |
|---|---|
| Distinct (%) | 46.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21758.43826 |
| Minimum | 10 |
|---|---|
| Maximum | 67368 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 55.7 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 120 |
| Q1 | 12100 |
| median | 19776 |
| Q3 | 30814.5 |
| 95-th percentile | 43728 |
| Maximum | 67368 |
| Range | 67358 |
| Interquartile range (IQR) | 18714.5 |
Descriptive statistics
| Standard deviation | 13565.13099 |
|---|---|
| Coefficient of variation (CV) | 0.6234423091 |
| Kurtosis | 0.2832102274 |
| Mean | 21758.43826 |
| Median Absolute Deviation (MAD) | 8856 |
| Skewness | 0.6138438548 |
| Sum | 154702496 |
| Variance | 184012778.7 |
| Monotonicity | Decreasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 120 | 54 | 0.8% |
| 72 | 41 | 0.6% |
| 48 | 38 | 0.5% |
| 60 | 37 | 0.5% |
| 96 | 35 | 0.5% |
| 36 | 35 | 0.5% |
| 108 | 30 | 0.4% |
| 12 | 29 | 0.4% |
| 84 | 27 | 0.4% |
| 24 | 26 | 0.4% |
| Other values (3307) | 6758 |
| Value | Count | Frequency (%) |
| 10 | 4 | 0.1% |
| 12 | 29 | |
| 20 | 6 | 0.1% |
| 24 | 26 | |
| 30 | 2 | < 0.1% |
| 36 | 35 | |
| 40 | 2 | < 0.1% |
| 48 | 38 | |
| 50 | 8 | 0.1% |
| 60 | 37 |
| Value | Count | Frequency (%) |
| 67368 | 1 | |
| 67272 | 1 | |
| 67200 | 1 | |
| 66840 | 1 | |
| 66744 | 1 | |
| 66696 | 2 | |
| 66480 | 1 | |
| 66288 | 1 | |
| 65904 | 1 | |
| 65760 | 2 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| TransaccionProductoID | ProductoID | NombreProducto | TipoTransaccionID | TipoTransaccionNombre | ClienteID | InvoiceID | ProveedorID | NombreProveedor | OrdenDeCompraID | FechaTransaccion | Cantidad | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 335504 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2069.0 | May 30,2016 | 67368.0 |
| 1 | 335845 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2072.0 | May 31,2016 | 67272.0 |
| 2 | 334872 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2067.0 | May 27,2016 | 67200.0 |
| 3 | 334385 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2065.0 | May 26,2016 | 66840.0 |
| 4 | 333714 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2061.0 | May 24,2016 | 66744.0 |
| 5 | 334073 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2063.0 | 2016-05-25 07:00:00.0000000 | 66696.0 |
| 6 | 334073 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2063.0 | 2016-05-25 07:00:00.0000000 | 66696.0 |
| 7 | 333459 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2057.0 | May 23,2016 | 66480.0 |
| 8 | 332869 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2055.0 | 2016-05-20 07:00:00.0000000 | 66288.0 |
| 9 | 332332 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 2053.0 | May 19,2016 | 65904.0 |
Last rows
| TransaccionProductoID | ProductoID | NombreProducto | TipoTransaccionID | TipoTransaccionNombre | ClienteID | InvoiceID | ProveedorID | NombreProveedor | OrdenDeCompraID | FechaTransaccion | Cantidad | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7100 | 258785 | 77 | "The Gu" red shirt XML tag t-shirt (White) XXS | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 1622.0 | 2015-09-07 07:00:00.0000000 | 12.0 |
| 7101 | 320530 | 77 | "The Gu" red shirt XML tag t-shirt (White) XXS | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 1988.0 | 2016-04-11 07:00:00.0000000 | 12.0 |
| 7102 | 333462 | 77 | "The Gu" red shirt XML tag t-shirt (White) XXS | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 2058.0 | 2016-05-23 07:00:00.0000000 | 12.0 |
| 7103 | 265276 | 77 | "The Gu" red shirt XML tag t-shirt (White) XXS | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 1657.0 | 2015-09-28 07:00:00.0000000 | 12.0 |
| 7104 | 265276 | 77 | "The Gu" red shirt XML tag t-shirt (White) XXS | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 1657.0 | 2015-09-28 07:00:00.0000000 | 12.0 |
| 7105 | 333462 | 77 | "The Gu" red shirt XML tag t-shirt (White) XXS | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 2058.0 | 2016-05-23 07:00:00.0000000 | 12.0 |
| 7106 | 279657 | 204 | Tape dispenser (Red) | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 1741.0 | Nov 16,2015 | 10.0 |
| 7107 | 170642 | 204 | Tape dispenser (Red) | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 1108.0 | 2014-11-03 07:00:00.0000000 | 10.0 |
| 7108 | 174625 | 204 | Tape dispenser (Red) | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 1132.0 | 2014-11-17 07:00:00.0000000 | 10.0 |
| 7109 | 194968 | 204 | Tape dispenser (Red) | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 1260.0 | 2015-02-02 07:00:00.0000000 | 10.0 |
Most frequently occurring
| TransaccionProductoID | ProductoID | NombreProducto | TipoTransaccionID | TipoTransaccionNombre | ClienteID | InvoiceID | ProveedorID | NombreProveedor | OrdenDeCompraID | FechaTransaccion | Cantidad | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 89565 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 605.0 | 2014-01-01 07:00:00.0000000 | 10200.0 | 2 |
| 1 | 90854 | 204 | Tape dispenser (Red) | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 611.0 | 2014-01-06 07:00:00.0000000 | 5270.0 | 2 |
| 2 | 91124 | 77 | "The Gu" red shirt XML tag t-shirt (White) XXS | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 613.0 | 2014-01-07 07:00:00.0000000 | 10620.0 | 2 |
| 3 | 91131 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 614.0 | 2014-01-07 07:00:00.0000000 | 10368.0 | 2 |
| 4 | 91132 | 204 | Tape dispenser (Red) | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 614.0 | 2014-01-07 07:00:00.0000000 | 5230.0 | 2 |
| 5 | 91409 | 204 | Tape dispenser (Red) | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 616.0 | 2014-01-08 07:00:00.0000000 | 5300.0 | 2 |
| 6 | 91617 | 78 | "The Gu" red shirt XML tag t-shirt (White) XS | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 617.0 | 2014-01-09 07:00:00.0000000 | 12348.0 | 2 |
| 7 | 91620 | 95 | "The Gu" red shirt XML tag t-shirt (Black) XL | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 617.0 | 2014-01-09 07:00:00.0000000 | 6348.0 | 2 |
| 8 | 91623 | 193 | Black and orange glass with care despatch tape 48mmx75m | 11 | Stock Receipt | 0.0 | 0.0 | 7.0 | Litware Inc. | 618.0 | 2014-01-09 07:00:00.0000000 | 10296.0 | 2 |
| 9 | 91814 | 98 | "The Gu" red shirt XML tag t-shirt (Black) 4XL | 11 | Stock Receipt | 0.0 | 0.0 | 4.0 | Fabrikam Inc. | 619.0 | 2014-01-10 07:00:00.0000000 | 12804.0 | 2 |